Skip to main content

Internet Archive Web Crawls

The Internet Archive discovers and captures web pages through many different web crawls. At any given time several distinct crawls are running, some for months, and some every day or longer. View the web archive through the Wayback Machine.

1,133,057
RESULTS
rss


Media Type
110
collections
1,131,415
web
1,532
data
Year
60,561
2018
204,610
2017
167,853
2016
139,882
2015
100,829
2014
138,510
2013
More right-solid
Topics & Subjects
991,536
crawldata
2,263
no404
1,452
wikipedia
811
wordpress
252
amazonbooks
7
end of term
More right-solid
Collection
More right-solid
Creator
984,169
internet archive
73,846
internetarchive
32,949
alexa internet
1,275
university of north texas
31
google, inc.
3
lekash@archive.org
More right-solid
Language
3,915
English
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
.com survey started January 2011
collection
2,535
ITEMS
158.7M
VIEWS
collection
eye 158.7M
Survey crawl of .com domains started January 2011.
Topic: webcrawl
2004 Election
collection
178
ITEMS
5.8M
VIEWS
collection
eye 5.8M
2004 Election crawl performed by Internet Archive. This data is currently not publicly accessible.
2004 Indian Ocean earthquake and tsunami
collection
42
ITEMS
2.8M
VIEWS
collection
eye 2.8M
Data related to the 2004 Indian Ocean earthquake and tsunami collected by Internet Archive. This data is currently not publicly accessible.
Host Screen Captures
by lekash@archive.org
data
eye 129
favorite 0
comment 0
Host Screen Captures
by lekash@archive.org
data
eye 137
favorite 0
comment 0
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Mon Dec 12 19:03:05 PST 2016 to Mon Dec 12 12:01:41 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Mon Dec 12 19:52:12 PST 2016 to Mon Dec 12 13:25:03 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Mon Dec 12 21:33:43 PST 2016 to Mon Dec 12 14:39:46 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Mon Dec 12 22:33:37 PST 2016 to Mon Dec 12 16:42:11 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Tue Dec 13 00:44:01 PST 2016 to Mon Dec 12 17:58:44 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Tue Dec 13 02:19:17 PST 2016 to Mon Dec 12 19:25:50 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Tue Dec 13 03:24:10 PST 2016 to Mon Dec 12 19:54:15 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Tue Dec 13 03:51:45 PST 2016 to Mon Dec 12 20:15:22 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Tue Dec 13 04:07:16 PST 2016 to Mon Dec 12 20:36:37 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Tue Dec 13 04:32:44 PST 2016 to Mon Dec 12 21:01:17 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Tue Dec 13 04:49:59 PST 2016 to Mon Dec 12 21:15:30 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Tue Dec 13 05:07:49 PST 2016 to Mon Dec 12 21:30:46 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Tue Dec 13 05:27:16 PST 2016 to Mon Dec 12 22:06:44 PST 2016.
Topic: crawldata
Internet Archive crawldata from End of Term 2016, captured by wbgrp-crawl005.us.archive.org:EOT-2016 from Tue Dec 13 05:44:52 PST 2016 to Mon Dec 12 22:26:57 PST 2016.
Topic: crawldata
Topic: crawldata
Topic: crawldata
Topic: crawldata
Topic: crawldata
Topic: crawldata
Topic: crawldata
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
No reports found for crawljob: newscrawl
Topic: crawldata
Amazon Crawl
collection
2
ITEMS
250
VIEWS
collection
eye 250
Crawl of Amazon. These files are currently not publicly accessible.
Amazon Crawl
web
eye 48
favorite 0
comment 0
A collection of government and military FTP sites captured automatically for backup. In this item: casino.aoml.noaa.gov ftpprd.ncep.noaa.gov gimms.gsfc.nasa.gov goldsmr4.sci.gsfc.nasa.gov is.sci.gsfc.nasa.gov ldas3.ncep.noaa.gov measures.ecs.nasa.gov oco2.gesdisc.eosdis.nasa.gov public-ftp.agl.faa.gov pwgdata.gsfc.nasa.gov ww4.dnr.wa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov
A collection of government and military FTP sites captured automatically for backup. In this item: aftp.cmdl.noaa.gov